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In this paper two scheduling models are addressed. First is the standard model (unicast) where re- 
quests (or jobs) are independent. The other is the broadcast model where broadcasting a page can satisfy 
multiple outstanding requests for that page. We consider online scheduling of requests when they have 
deadUnes. Unlike previous models, which mainly consider the objective of maximizing throughput while 
respecting deadlines, here we focus on scheduling all the given requests with the goal of minimizing the 
maximum delay factor. The delay factor of a schedule is defined to be the minimum a > 1 such that 
C^ ' each request i is completed by time a^ + a{di — Ui) where a^ is the arrival time of request i and di is its 

^*\ I deadline. Delay factor generalizes the previously defined measure of maximum stretch which is based 

only the processing times of requests IIBCM98I FBMR02I . 
Q I We prove strong lower bounds on the achievable competitive ratios for delay factor scheduling even 

with unit-time requests. Motivated by this, we then consider resource augmentation analysis IKPOOI and 
prove the following positive results. For the unicast model we give algorithms that are (1 + e)-speed 
0(i)-competitive in both the single machine and multiple machine settings. In the broadcast model we 
give an algorithm for similar-sized pages that is (2 + e)-speed 0(^)-competitive. For arbitrary page 

On ■ sizes we give an algorithm that is (4 + e)-speed 0(^)-competitive. 
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1 Introduction 

Scheduling requests (or job^j) that anive onhne is a fundamental problem faced by many systems and 
consequently there is a vast literature on this topic. A variety of models and performance metrics are studied 
in order to capture the requirements of a system. In this work, we consider a recently suggested performance 
measure called delay factor HCEGKOSII when each request has an arrival time (also referred to as release 
time) and a deadline. We consider both the traditional setting where requests are independent, and also the 
more recent setting of broadcast scheduling when different requests may ask for the same page (or data) and 
can be simultaneously satisfied by a single transmission of the page. We first describe the traditional setting, 
which we refer to as the unicast setting, to illustrate the definitions and and then describe the extension to 
the broadcast setting. 

We assume that requests aiTive online. The arrival time a,, the deadline d/, and the processing time li of 
a request Jj are known only when i arrives. We refer to the quantity Si = {di — Oj) as the slack of request 
i. There may be a single machine or m identical machines available to process the requests. Consider an 
online scheduling algorithm A. Let fi denote the completion time or finish time of Jj under A. Then the 
delay factor of A on a given sequence of requests a is defined as a^{a) = max{l, max j^g^r j^'I^' j. In 
other words a"^ measures the factor by which A has delayed jobs in proportion to their slack. The goal 
of the scheduler is to minimize the (maximum) delay factor. We consider worst-case competitive analysis. 
An online algorithm A is r-competitive if for all request sequences a, a^{a) < ra*{a) where a* (a) is 
the delay factor of an optimal offline algorithm. Delay factor generalizes the previously studied maximum 
stretch measure introduced by Bender, Chakraborty and Muthukrishnan ||BCM98i The maximum stretch of 
a schedule A is raaxj^^a-ifi — o,i)/£i where £i is the length or processing time of Jj. By setting dj = Oj + £j 
for each request Jj it can be seen that delay factor generalizes maximum stretch. 

In the broadcast setting, multiple requests can be satisfied by the same transmission. This model is 
inspired by a number of recent applications — see IIBNBNS021 IAF991 IAFZ951 IBMOOl for the motivating 
applications and the growing literature on this topic. More formally, there are n distinct pages or pieces 
of data that are available in the system, and clients can request a specific page at any time. This is called 
the pull-model since the clients initiate the request and we focus on this model in this paper (in the push- 
model the server transmits the pages according to some frequency). Multiple outstanding requests for the 
same page are satisfied by a single transmission of the page. We use J(p^i) to denote i'th request for a 
page p G {1,2,..., n}. We let a(p^i) and d/p j) denote the arrival time and deadline of the request Jip^i). 
The finish time /(p,,) of a request J(p^i) is defined to be the earliest time after a(^p^i^ when the page p is 
sequentially transmitted by the scheduler. Note that multiple requests for the same page can have the same 
finish time. The delay factor a^ for an algorithm A over a sequence of requests a is now defined as 
max{l,max.„i^p^ fr')-"(p>') \ 

Motivation: There are a variety of metrics in the scheduling literature and some of the well-known and 
widely used ones are makespan and average response time (or flowtime). More recently, other metrics such 
as maximum and average stretch, which measure the waiting time in proportion to the size of a request, 
have been proposed IIBCM98I IKS W99[ |Sga98p ; these measures were motivated by applications in databases 



and web server systems. Related metrics include Lp norms of response times and stretch IIBP031 IAA031 
ICGKK04ll for 1 < p < oo. In a variety of applications such as real-time systems and data gathering systems, 
requests have deadlines by which they desire to be fulfilled. In real-time systems, a hard deadline implies 
that it cannot be missed, while a soft deadline implies some flexibility in violating it. In online settings it is 
difficult to respect hard deadlines. Previous work has addressed hard deadlines by either considering periodic 
tasks or other restrictions HBBOSII . or by focusing on maximizing throughput (the number of jobs completed 



'in this paper we use requests instead of jobs since we also address the broadcast scheduling problem where a request for a page 
is more appropriate terminology than a job. 



by their deadline) IIKC041 ICLTWOl IZFC+06ll . It was recendy suggested by Chang et al. IICEGK08II that 
delay factor is a useful and natural relaxation to consider in situations with soft deadlines where we desire 
all requests to be satisfied. In addition, as we mentioned already, delay factor generalizes maximum stretch 
which has been previously motivated and studied in IIBCM981 IBMR02I . 

Results: We give the first results for online scheduling for minimizing delay factor in both the unicast and 
broadcast settings. Throughout we assume that requests are allowed to be preempted if they have varying 
processing times. We first prove strong lower bounds on online competitiveness. 

• For unicast setting no online algorithm is A'^-^/2-competitive where A is the ratio between the maxi- 
mum and minimum slacks. 

• For broadcast scheduling with n unit-sized pages there is no n/4-competitive algorithm. 

We resort to resource augmentation analysis, introduced by of Kalyanasundaram and Pruhs HKPOOII . to 
overcome the above lower bounds. In this analysis the online algorithm is given faster machines than the 
optimal offline algorithm. For s > 1, an algorithm A is s-speed r-competitive if A when given s-speed 
machine(s) achieves a competitive ratio of r. We prove the following. 



• 



For unicast setting, for any e G (0, 1], there are (1 + e)-speed 0(l/e)-competitive algorithms in both 
single and multiple machine cases. Moreover, the algorithm for the multiple machine case immedi- 
ately dispatches an arriving request to a machine and is non-migratory. 

• For broadcast setting, for any e G (0, 1], there is a (2 + e)-speed 0(l/e^)-competitive algorithm for 
unit-sized (or similar sized) pages. If pages can have varying length, then for any e G (0, 1], there is a 
(4 + e)-speed 0(l/e^)-competitive algorithm. 

Our results for the unicast setting are related to, and borrow ideas from, previous work on minimizing 
Lp norms of response time and stretch IIBP03II in the single machine and parallel machine settings IIAA03I 
|CGKK04|. 

Our main result is for broadcast scheduling. Broadcast scheduling has posed considerable difficulties 
for algorithm design. In fact most of the known results are for the offline setting MKPVOOI IEH021 lGkKW041 
IGKPS061 IBCS06I IBCKN051 and several of these use resource augmentation! The difficulty in broadcast 
scheduling arises from the fact that the online algorithm may transmit a page multiple times to satisfy 
distinct requests for the same page, while the offline optimum, which knows the sequence in advance, 
can save work by gathering them into a single transmission. Online algorithms that maximize throughput 
IIKC041 ICLTW04I IZFC+06[ ICDJ+06l get around this by eliminating requests. Few positive results are 
known in the online setting where all requests need to be scheduled i BMOOl lEP031IEP()5l and the analysis in 
all of these is quite non-trivial. In contrast, our algorithm and analysis are direct and explicitly demonstrate 
the value of making requests wait for some duration so as to take advantage of potential future requests 
for the same page. We hope this idea can be further exploited in other broadcast scheduling contexts. We 
mention that even in the offline setting, only an LP-based 2-speed algorithm is known for delay factor with 
unit-sized pages IICEGK08I . 

Related Work: We refer the reader to the survey on online scheduling by Pruhs, Sgall and Tomg IIPST04II 
for a comprehensive overview of results and algorithms (see also IIPru07ll ). For jobs with deadlines, the 
well-known earliest-deadline-first (EDF) algorithm can be used in the offline setting to check if all the jobs 
can be completed before their deadline. A substantial amount of literature exists in the real-time systems 
community in understanding and characterizing restrictions on the job sequence that allow for schedulability 
of jobs with deadlines when they anive online or periodically. Previous work on soft deadlines is also 
concerned with characterizing inputs that allow for bounded tardiness. We refer the reader to IILLS07II for 
the extensive literature scheduhng issues in real-time systems. 



Closely related to our work is that on max stretch ||BCM98i where it is shown that no online algorithm 
is 0{P^'^) competitive even in the preemptive setting where P is ratio of the largest job size to the smallest 
job size. IIBCM98II also gives an 0{\fP) competitive algorithm which was further refined in IIBMR02I . Re- 
source augmentation analysis for Lp norms of response time and stretch from the work of Bansal and Pruhs 
IIBP03I implicitly shows that the shortest job first (SJF) algorithm is a (1 + e)-speed 0(l/e)-competitive 
algorithm for max stretch. Our work shows that this analysis can be generaUzed for the delay factor metric. 
For multiple processors our analysis is inspired by the ideas from IIAA03I ICGKK04II . 

Broadcast scheduling has seen a substantial amount of research in recent years; apart from the work that 
we have already cited we refer the reader to IICK06IIKK04II . the recent paper of Chang et al. IICEGK08II . 
and the surveys IIPST04I fPruOVII for several pointers to known results. Our work on delay factor is inspired 
by IICEGK08I . As we mentioned already, a good amount of the work on broadcast scheduling has been on 
offline algorithms including NP-hardness results and approximation algorithms (often with resource aug- 
mentation). For delay factor there is a 2-speed optimal algorithm in the offline setting and it is also known 
that unless P = NP there is no 2 — e approximation IICEGK08I . In the online setting the following re- 
sults are known. For maximum response time, it is shown in MBMOO ', 'CEGK081 that first-in-first-out (FIFO) 
is 2-competitive. For average response time, Edmonds and Pruhs |EP03| give a (4 + e)-speed 0(l/e)- 
competitive algorithm; their algorithm is an indirect reduction to a complicated algorithm of Edmonds 
MEdmOOII for non-clairvoyant scheduling. They also show in IIEP05I that longest-wait-first (LWF) is a 6- 
speed 0(l)-competitive algorithm for average response time. Constant competitive online algorithms for 



maximizing throughput IIKC04[ ICLTWOl IZFC+06[ ICD J+061 for unit-sized pages. 



We describe our results for the unicast setting in Section|2]and for the broadcast settings in Section[3] 

Notation: We let Si = di — ai denote the slack of Jj in the unicast setting. When requests have varying 
processing times (or lengths) we use £j to denote the length of Jj. We assume without loss of generality that 
Si > li. In the broadcast setting, (p, i) denotes the i'th request for page p. We assume that the requests for 
a page are ordered by time and hence a(p j) < a(pj) for i < j. In both settings we use A to denote the ratio 
of maximum slack to the minimum slack in a given request sequence. 

2 Unicast Scheduling 

In this section we address the unicast case where requests are independent. We may thus view requests as 
jobs although we stick with the use of requests. For a request Jj, recall that Oj, dj, £«, /j denote the amval 
time, deadline, length, and finish time respectively. An instance with all £j = 1 (or more generally the 
processing times are the same) is referred to as a unit-time instance. It is easy to see that preemption does 
not help much for unit-sized instances. Assuming that the processing times are integer valued then in the 
single machine setting one can reduce an instance with varying processing time to an instance with unit- 
times as follows. Replace Jj, with length £i, by ii unit-sized requests with the same arrival and deadline as 
that of Jj. 

As we had remarked earlier, scheduling to minimize the maximum stretch is a special case of scheduling 
to minimize the maximum delay factor. In IIBCM98II a lower bound of P^'^ is shown for online maximum 
stretch on a 1-speed machine where P is the ratio of the maximum processing time to the minimum process- 
ing time. They show that this bounds holds even when P is known to the algorithm. This implies a lower 
bound of A^' "^ for minimizing the maximum delay factor. Here we improve the lower bound for maximum 
stretch to P^-^/2 when the online algorithm is not aware of P. A proof can be found in the appendix. 

p. 4 

Theorem 2.1. There is no 1-speed ^ — competitive algorithm for online maximum stretch when P is not 
known in advance to the algorithm. 



Corollary 2.2. There is no 1-speed -^ — competitive algorithm for delay factor scheduling when A is not 
known in advance with unit-time requests. 

In the next two subsections we show that with (1 + e) resource augmentation simple algorithms achieve 
an 0(l/e) competitive ratio. 

2.1 Single Machine Scheduling 

We analyze the simple shortest-slack-first (SSF) algorithm which at any time t schedules the request with 
the shortest slack. 



Algorithm: SSF 

At any time t schedule the request with with the minimum stretch which has 
not been satisfied. 



Theorem 2.3. Thealgorithm SSF is {l + e)-speed {-) -competitive for minimizing the maximum delay factor 
in unicast scheduling. 

Proof. Consider an arbitrary request sequence a and let a be the maximum delay factor achieved by SSF 
on (T. If a = 1 there is nothing to prove, so assume that a > 1. Let Jj be the request that witnesses a, that 
is a = (/j — Qi)/ Si. Note that SSF does not process any request with slack more than Si in the interval 
[tti, fi]. Let t be the largest value less than or equal to Oj such that SSF processed only requests with slack 
at most Si in the interval [t, fi]. It follows that SSF had no requests with slack < Si just before t. The 
total work that SSF processed in [t, fi] on requests with slack less than equal to Si is (1 + e)(/j — t) and 
all these requests arrive in the interval [t, fi]. An optimal offline algorithm with 1-speed can do total work 
of at most {fi — t) in the interval [t, fi] and hence the earliest time by which it can finish these requests is 
fi + e{fi — t) > fi -\- e{fi — Qi). Since all these requests have slack at most Si and have arrived before fi, 
it follows that a* > e{fi — ai)/Si where a* is the maximum delay factor of the optimal offline algorithm 
with 1-speed machine. Therefore, we have that a/a* < 1/e. D 

Remark 2.4. For unit-time requests, the algorithm that non-preemptively schedules requests with the short- 
est slack is a {1 -\- e)-speed — competitive for maximum delay factor 

2.2 Multiple Machine Scheduling 

We now consider delay factor scheduling when there are m machines. To adapt SSF to this setting we take 
intuition from previous work on minimizing Lp norms of flow time and stretch IIBP031 IAA03I ICGKK04II . 
We develop an algorithm that immediately dispatches an arriving request to a machine, and further does not 
migrate an assigned request to a different machine once it is assigned. Each machine essentially runs the 
single machine SSF algorithm and thus the only remaining ingredient to describe is the dispatching rule. 
For this purpose the algorithm groups requests into classes based on their slack. A request Jj is said to be 
in class k if Si G [2^, 2*^+^). The algorithm maintains the total processing time of requests (referred to as 
volume) that have been assigned to machine x in each class k. Let U^f.{t) denote the total processing time 
of requests assigned to machine x by time t of class k. With this notation, the algorithm SSF-ID (for SSF 
with immediate dispatch) can be described. 



Algorithm: SSF-ID 

When a new request Jj of class k arrives at time t, assign it to a machine x 
where U^^{t) = min^ U^i.{t). 

Use SSF on each machine separately. 



The rest of this section is devoted to the proof of the following theorem. 

Theorem 2.5. SSF-ID is a (1 + e)-speed 0{-)-competitive algorithm for online delay factor scheduling on 
m machines. 

We need a fair amount of notation. For each time t, machine x, and class k we define several quantities. 
For example U^j^{t) is the total volume assigned to machine x in class k by time t. We use the predicate 
"< k" to indicate classes 1 to k. Thus U^^{t) is the total volume assigned to machine x in classes 1 to k. We 
let R^j^{t) to denote the remaining processing time on machine x at time t and let P^ki^) denote the total 
volume that x has finished on requests in class k by time t. Note that PZki^) — U=ki^) ~ ^=ki^)- ^^^ these 
quantities refer to the algorithm SSF-ID. We use V2j,{t) and V=k{t) to denote the remaining volume of 
requests in class k in an optimal offline algorithm with speed 1 and SSF-ID with speed (1 + e), respectively. 
Observe that V=k{t) = J2x ^=ki^)- "^^^ quantities V<^(i) and V<k(t) are defined analogously. 

The algorithm SSF-ID balances the amount of processing time for requests with similar slack. Note that 
the assignment of requests is not based on the current volume of unfinished requests on the machines, rather 
the assignment is based on the volume of requests that were assigned in the past to different machines. We 
begin our proof by showing that the volume of processing time of requests less than or equal to some slack 
class is almost the same on the different machines at any time. Several of these lemmas are essentially the 
same as in IIAA03I . 

Observation 1. For any time t and two machines x and y, \U!^j^{t) — f/^^(t)| < 2 ~^^. This also implies 

that\U^,{t)-UUt)\<'2'^'- 

Proof. The first inequality holds since all of the requests of class k are of size < 2^^^. The second inequality 
follows easily form the first. D 

Proofs of the next two lemmas can be found in the appendix. 

Lemma 2.6. Consider any two machines x and y. The difference in volume of requests that have already 
have been processed is bounded as |P<^(t) — P^^(t)| < 2^^"^. 

Lemma 2.7. At any time t the difference between the residual volume of requests that needs to be processed, 
on any two different machines, x and y is bounded as |i?^^(i) — i?^^,(t)| < 2^~^^. 

Corollary 2.8. At any time t, 1^<fc(t) > V<k{t) - m2^+^. 

Now we get to the proof of the upper bound on SSF-ID, when given (1 + e)-speed, in a similar fashion 
to the single machine case. Consider an arbitrary request sequence a and let Jj be the request that witnesses 
the delay factor a of SSF-ID on a. Let k be the class of Jj. Therefore a = {fi — ai)/ Si. Also, let x be the 
machine on which Jj was processed by SSF-ID. We use a* to denote the delay factor of some fixed optimal 
offline algorithm that uses m machines of speed 1. 

Let t be the last time before at when machine x processed a request of class > k. Note that t < ai since 
x does not process any request of class > A: in the interval [ai, fi]. At time t we know by Corollary I2.8l that 



V"<^.(t) > V<k{t) - m2^+'^. If fi <ai + 2^+^ then SSF-ID achieves a competitive ratio of 16 since Ji is in 
class k. Thus we will assume from now on that fi> ai + 2'^"''^. 

In the interval / = [t, fi), SSF-ID completes a total volume of of (1 + e)(/j — t) on machine x. Using 
Lemma l2!6l any other machine y also processes a volume of (1 + e)(/j — t) — 2'^+'^ in /. Thus the total 
volume processed by SSF-ID during / in requests of classes < /c is at least m{\ + e)(/j — t) — m2^~^^. 
During /, the optimal algorithm finishes at most m{fi — t) volume in classes < k. Combining this with 
Corollary 12.81 we see that 

V^kifi) > V<k{t)-m2^+^ + m{l + e){fi-t)-m2^+^ 

> V<k{t) + m(l + e)(/, -t)- m2'=+4 > €m{fi - t). 

In the penultimate inequality we use the fact that fi — t>fi — ai> 2'^+^. Without loss of generality assume 
that no requests arrive exactly at fi. Therefore V^<fc(/j) is the total volume of requests in classes 1 to k that 
the optimal algorithm has left to finish at time fi and all these requests have arrived before fi. The earliest 
time that the optimal algorithm can finish all these requests is by fi + e(/j — t) and therefore it follows that 
a* > e{fi - t)/2*^+i. Since a < {fi - a.i)/2^ and t < Ui, it follows that a < 2a* /e. 
Thus a < max{16, 2a* /e} which finishes the proof of Theorem 12.5 1 

3 Broadcast Scheduling 

We now move our attention to the broadcast model where multiple requests can be satisfied by the trans- 
mission of a single page. Most of the literature in broadcast scheduling is concerned with the case where 
all pages have the same size which is assumed to be unit. A notable exception is the work of Edmonds and 
Pruhs IIEP03II . Here we consider both the unit-sized as well as arbitrary sized pages. 

We start by showing that no 1-speed online algorithm can be (?i/4) -competitive for delay factor where 
n is the total number of unit-sized pages. We then show in Section lTTl that there is a (2 + e)-speed 0(l/e^)- 
competitive algorithm for unit-sized pages. We prove this for the single machine setting and it readily 
extends to the multiple machine case. Finally, we extend our algorithm and analysis to the case of different 
page sizes to obtain a (4 + e)-speed 0(l/e^)-competitive algorithm in Section ll!2l We believe that this can 
be extended to the multiple machine setting but leave it for future work. 

Theorem 3.1. Every 1-speed online algorithm for broadcast scheduling to minimize the maximum delay 
factor is Vt{n)-competitive where n is number of unit-sized pages. 

The proof of Theorem 13. II can be found in the appendix. 

3.1 A Competitive Algorithm for Unit-sized Pages 

We now develop an online algorithm, for unit-sized pages, that is competitive given extra speed. It is easy 
to check that unlike in the unicast setting, simple algorithms such as SSF fail to be constant competitive in 
the broadcast setting even with extra speed. The reason for this is that any simple algorithm can be made to 
do an arbitrary amount of "extra" work by repeatedly requesting the same page while the adversary can wait 
and finish all these requests with a single transmission. We use this intuition to develop a variant of SSF that 
adaptively introduces waiting time for requests. The algorithm uses a single real-valued parameter c < 1 to 
control the waiting period. The algorithm SSF-W (SSF with waiting) is formally defined below. We note 
that the algorithm is non-preemptive in that a request once scheduled is not preempted. As we mentioned 
earlier, for unit-sized requests, preemption is not very helpful. The algorithm keeps track of the maximum 
delay factor it has seen so far, at, and this depends on requests that are yet to be completed (we set oq = 1). 



The important feature of the algorithm is that it considers requests for scheduling only after they have waited 
sufficiently long when compared to their adaptive slack. 



Algorithm: SSF-W 

• Let at be the maximum delay factor SSF-W has at time t. 



• 



At time t, let Q{t) = {J[p^i) \ J(p,i) has not been satisfied and ^ ''';'•' > cat5(p j)}. 

If the machine is free at t, schedule the request in Q{t) with the smallest slack non-preemptively. 



We now analyze SSF-W when it is given a (2 + e)-speed machine. Let a be an arbitrary sequence of 
requests. Consider the^zr^f time t where SSF-W achieves the maximum delay factor a^^^"^. At time t, 
SSF-W must have finished a request Jop^k) which caused SSF-W to have this delay factor. Hence, SSF-W 
has a maximum delay factor of {f[p^k)~0'{p,k))/S(p^k) where f\p^k) is the time SSF-W satisfies request J(p^k)- 
We let OPT denote some fixed offline optimum algorithm and let a* denote the optimum delay factor. 

We now prove the most interesting difference between unicast and broadcast scheduling. The following 
lemma shows that forcing a request to wait in the queue, for a small period of time, can guarantee that our 
algorithm is satisfying as many requests as OPT by a single broadcast unless OPT has a similar delay factor. 

Since J(p^k) defines a^^*^"^, we observe that from time t' = a(p,fc) + c{f(^p^k) ~ 0{p,fc))> the request 
J{p, k) is ready to be scheduled and hence the algorithm is continuously busy in the interval / = [t' , f(p^k)\ 
processing requests of slack no more than that of J(p^k)- 

Lemma 3.2. Consider the interval I = [t' , fu^k))- Suppose two distinct requests Jixj) ond J(x,i)for the 
same page x were satisfied by SSF-W during I at different times. If OPT satisfies both of these requests by 
a single broadcast then a < tO;*. 

Proof. Without loss of generality assume that i > j; therefore a(xj) ^ 0'(x,i)- Request J[x,i) must have 
aiTived during /, otherwise SSF-W would have satisfied Jix,i) when it satisfied J(x,j)- We observe that 

Since J(x,j) was scheduled after t', it follows that SSF-W would have made it wait at least cat'S(^x,j) 
which implies that 

f(x,j) > a(xj) + (^0!t'S(^x,j)- 

Note that SSF-W satisfies J(x,i) by a separate broadcast from J[x,j) which implies that a(^x,i) > f{x,j)- 
However, OPT satisfies both requests by the same transmission which implies that OPT finishes J(x,j) ^^ 
earlier than a(^ ,/). Therefore the delay factor of OPT is at least the delay factor for J[x,j) m OPT which 
implies that 

0'{x,i) - ^{x,j) J{x,j) - O'ixJ) Cat'D(^^j-j 5^ SSF-W 



D 



S{x,j) S(^x,j) S(^x,j) 



Note that previous lemma holds for any two requests scheduled by SSF-W during interval / regai^dless 
of when OPT schedules them, perhaps even after /(p,fc). 

Lemma 3.3. Consider the interval I = [t' , ftp^k)]- ^'^J request which SSF-W scheduled during I must have 
arrived after time a^^p^^) - (1 - c){f(p,k) - a(p±)). 



Proof. For sake of contradiction, assume that a request J(x,j) scheduled by SSF-W on the interval / has 
arrival time less than a(p fc) — (1 — c){f(^p^i.^ — a(pj^.)). Since SSF-W finishes this request during /, f[x,j) ^ 
'^(p,k) + (^{f{p,k) ~ ^(p,fe))- Also, as we observed before, all requests scheduled during / by SSF-W have 
slack no more than that of J(p^k) which implies that Si^^j) ^ ^{p,k)- However this implies that the delay 
factor of J[x,j) is at least 

ifjxj) - Q-(xj)) ^ Q(p,fc) + c(/(p,fc) - «(p,fc)) - ("(p.fc) - (1 - c)(/(p,fc) - a{p,k))) 

S{x,j) S(^x,j) 

^ (/(p,fc) -Q(p,fc)) ^ {f{p,k) -a{p,k)) ^ Q,ssF-w 
S{x,j) Sf^p^k) 

This is a contradiction to the fact that J{p^k) is the first request that witnessed the maximum delay factor 
of SSF-W. ' D 

Now we are ready to prove the competitiveness of SSF-W. 
Lemma 3.4. The algorithm SSF-W when given a {2+e)-speed machines satisfies a^^^'^ < max{^, ^_^_^ }a*. 
Proof. The number of broadcasts which SSF-W transmits during the interval / = [t', fip^k)] is 

(2 + e)(/(p,fc) - t') > (2 + e)(l - c)(/(p,fc) - a(^p^,)). 

From Lemma [331 all the requests processed during / have arrived no earlier than a(p fc) — (1 — c){f(pk) — 
<^(p,k))- Also, each of these requests has slack no more than S(^p±y We restrict attention to the requests 
satisfied by SSF-W during /. We consider two cases 

First, if there are two requests for the same page that SSF-W satisfies via distinct broadcasts but OPT 
satisfies using one broadcast, then by Lemma 1X2] a^^^'^ < -^a* and we are done. 

Second, we assume that OPT does not merge two requests for the same page whenever SSF-W does not 
do so. It follows that OPT also has to broadcast (2 + e)(l — c){f(^p^i^) — a(p,fc)) pages to satisfy the requests 
that SSF-W did during I. Since these requests anived no earlier than atp^i^\ — (1 — c){fipk\ — a/p^O, OPT, 
which has a 1-speed machine, can finish them at the earliest by 

(2 + e)(l - c)(/(p_fc) - a(p,fc)) + a(p_fc) - (1 - c)(/(p_fc) - ai^p^^-^) > /(p^^) + (e - c - ce){f{j,^k) - «(p,fc))- 
Since each of these requests has slack at most Sip^k\ and arrived no later than f(p^k)' we have that 

a* > if(p,k) + i^^- c- ce){f(^p^k) - 0'(p,k)) - f(p,k))/S(p^k) 

> {e-c-ce){f(_p^k)-a{p,k))/S(p,k) > (e - c - ce)a^^''"^. 

D 
The previous lemma yields the following theorem. 

Theorem 3.5. With c = e/2, SSF-W is a {2 + e)-speed 0{-^)-competitive algorithm for minimizing the 
maximum delay factor in broadcast scheduling with unit-sized pages. 



It may appear that SSF-W needs knowledge of e. However, another way to interpret Lemma [33] is that 
for any fixed constant c, SSF-W with parameter c is constant competitive in all settings where its machine is 
at least (2 + 2^/c) times the speed of the optimal algorithm. Of course, it would be ideal to have an algorithm 
scales with e without any knowledge of e. We leave the existence of such an algorithm for future work. 

Now consider having m machines where we have (1 + e)-speed. Since we are using unit time requests, 
this is analogous to OPT having one ?n-speed machine and SSF-W having a {m{2 + e))-speed machine. 
Thus, one can extend the above analysis to the multiple machine setting with unit-sized pages in a straight 
forward fashion. 



3.2 Varying Page Sizes 

In this section we generalize our algorithm for unit-sized pages to the setting where each page has potentially 
a different page size. We let Ip denote the length of page p. In this setting we allow preemption of transmis- 
sions. Suppose the transmission of a page p is started at time ti and ends at time t^, p may be preempted 
for other transmissions and hence t2 — t\ > p. A request for a page p is satisfied by the transmission of p 
during the interval [ti, t2] only if the request arrives before ii. It is possible that the transmission of a page 
p is abandoned and restarted due to the arrival of a new request for p with a smaller slack. This may lead 
to further wasted work by the algorithm and increases the complexity of the analysis. Here we show that a 
natural adaptation of SSF-W is competitive even in this more general setting if it is given (4 + e)-speed. 

We outline the details of modifications to SSF-W. As before, at any time t, the algorithm considers 
broadcasting a request J(p,i) if g ^^''' > catS(^p^i^ ; these are requests that have waited long enough. Among 
these requests, the one with the smallest slack is scheduled. Note that the waiting is only for requests that 
have not yet been started; any request that has already started transmission is available to be scheduled. The 
algorithm breaks ties ai^bitrarily, yet ensures that if a request J(p,fc) is started before a request J(p' j) then 
J(p.k) will be finished before request J[p'.j)- Note that the algorithm may preempt a request J[p^i) by another 
request J(^p,k) for the same page p even though i < A; if ^(p ,fc) < >S'(p j) . In this case the transmission of J[p^i) 
is effectively abandoned. Note that transmission of a page p may be repeatedly abandoned. 

We now analyze the algorithm assuming that it has a (4 + e)-speed advantage over the optimal offline 
algorithm. The extra factor in speed is needed in our analysis to handle the extra wasted work due to 
potential retransmission of a page p after a large portion of it has already been transmitted. As before, let 
(7 be a sequence of requests and let t be the^^r^? time SSF-W achieves the maximum delay factor a^^^"^. 
At time t, it must be the case that a request J{p^k) was finished which caused SSF-W to have his maximum 
delay factor. Hence, SSF-W has a maximum delay factor of if{p^k) ~ o.{p,k)/ S[p^k)) where f[p^k) is the time 
SSF-W satisfied request J[p^k)- 

As with the case with unit time requests, at time t' = aip^k) + '^{f(p,k) ~ ^(p,k)) the request Jtp^k) is ready 
to be scheduled and the algorithm is busy on the interval I = [t', f(p^k)] processing requests of slack at most 

We say that a request J(p,i) is started at time t if t is the first time at which the algorithm picked J[p^i) 
to transmit its page. Multiple requests may be waiting for the same page p but only the request with the 
smallest slack that is picked by the algorithm is said to be started. Thus a request may be satisfied although 
it is technically not started. Also, a request J{p,i) that is started may be abandoned by the start of another 
request for the same page. 

The lemma below is analogous to Lemma [l!2l but requires a more careful statement since requests may 
now be started and abandoned. 

Lemma 3.6. Consider two distinct requests J(x,j) ^'^'^ J(x,i)fa^ ^'^^ same page x where i > j such that they 
are both satisfied by OPT via the same transmission. If SSF-W starts Jtxj) in [t' , /(p,A:)] before the arrival 
ofJ(x,i), then a^^^-"^ < ^a*. 

Observe that the request J[x,j) ni^y be satisfied together with J[x,i) even though it starts before the arrival 

of -/(aj.i)- 

Proof. As before, Uf > ''^•^'"'g'""'''''''^ > ca^^^'^ since J(p ^j is still alive at t'. Since J(^ ,,) is started after 
t', it follows that SSF-W would have made it wait at least catiSix,j)- Let t > t' be the start time of J(x,j)- 
Therefore t > a(^x,j) + cai'5(^, j). By our assumption, t < a(^x,i) ^rid therefore a(^x,i) > (^(x,j) + cat'S(^x,j)- 

Since OPT satisfies these two requests by the same transmission, the finish time of J[x,j) ^^ OPT is at 
least a(^x,i)- Therefore, 



^* y «(x,») a{x,j) ^ cat'S(^^j) ^ ^^^^ ^ ^2^ssF-w_ 
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The proof of the lemma below is very similar to that of Lemma [331 

Lemma 3.7. Consider the interval I = [i', /(p.fc)]- ^fiy request which is alive with slack < Stp^i^y but 
unsatisfied by SSF-W at time t' must have arrived after time aip^k) ~ (1 ~ c){f(p,k) ~ (^(p,k))- 

Now we are ready to prove the competitiveness of SSF-W. Although the outline of the proof is similar 
to that of Lemma 13.41 it requires more careful reasoning to handle the impact of abandoned transmissions 
of pages. Here is where we crucially rely on the speed of (4 + e). 

Lemma 3.8. The algorithm SSF-W when given a {A+e)-speed machine satisfies q^^^'^ < maxj^-, -t-^ — ^t}q^*- 

Proof. We consider the set of requests satisfied by SSF-W during the interval I = [t', f{p^k)]- AH of these 
requests have slack at most 5'(p fc), and from Lemma 1377] and the property of the algorithm, have arrived no 
earlier than a(^pk) — (1 — c)(/(pfc) — a(p fc)). Since SSF-W is busy throughout /, the volume of broadcasts 
it transmits during / is (4 + e)(/(p,fc) - t') > (4 + e)(l - c){f(^p^k) - a{p,fc))- 

We now argue that either Lemma 1331 applies in which case a^^'''"^ < a* jc? , or OPT has to transmit a 
comparable amount of volume to that of SSF-W. 

Fix a page x and consider the transmissions for x that SSF-W does during /. Let J(x,ii)i J{x,i2)^ • • • > J{x,ir) 
be distinct requests for x which cause these transmissions. Amongst these, only J{x,ii) m^Y have started be- 
fore t', the rest start during /. Note that we are not claiming that these transmissions ai^e satisfied separately; 
some of them may be preempted and then satisfied together. Observe that if J[x,ih,) starts at some time t then 
it implies that no request J(^, j ,) for /i' > /i has arrived by time t. Therefore by Lemma [3^ if OPT satisfies 
any two of these requests that SSF-W started in / by the same transmission, a^^^'^ < a* jc? and we are 
done. 

Otherwise, OPT satisfies each of J[x,i2)^ ■ ■ ■ ■> '^{x,ir) by separate transmissions. (If J[x,ii) was started 
by SSF-W before t', OPT could satisfy J[x,ii) ^nd J[x,i2) together and we would not be able to invoke 
Lemma 13.61 ). Therefore if r > 2 then the total volume of transmissions that OPT does to satisfy these 
requests for page x is at least (r — l)£x while SSF-W does at most rl^- lir = 1 then both OPT and SSF-W 
transmit page x once for its entire page length. In either case, the total volume of transmissions that OPT 
does is at least half those of SSF-W. Since x was arbitrary, it follows that the total number of transmissions 
that OPT does to satisfy requests that SSF-W satisfies during / is at least ^(4 + e)(l — c)(/(p ;.) — a/p fcO. 

From Lemma [3771 all the requests that SSF-W processes during / arrived no earlier than a(p^fc) — (1 — 
c)(/(p,fc) ~ '^(p,fc))- Since OPT has a 1-speed machine, it follows that OPT can finish these requests only by 
time 

-(4 + e)(l-c)(/(p^fc) - a^p,k)) + a{p,k) - (l-c)(/(p,fe) -^{M)) - f{p,k)+ ^^e- c- ce){f^p^k) - a^p^k))- 
Since each of these requests have slack at most 5'(p fc) and arrive no later than f(p^k)y 
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a* > i.f(p,k)+^{^-c-ce){f(^p^k)-a{p,k))-f{p,k))/S{p,k) > ^i.(^-c-ce){f(^p^k)-a{p,k))/S(p,k) > 2^^-^-' 



SSF-W 
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We thus obtain the following. 

Theorem 3.9. With c = e/2, SSF-W is a (4 + e)-speed 0{^)-competitive algorithm for minimizing the 
maximum delay factor in broadcast scheduling with arbitrary page sizes. 
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4 Concluding Remarks 

In this paper we have initiated the study of online algorithms for minimizing delay factor when requests have 
deadlines. Our main result is broadcast scheduling where the algorithm and analysis demonstrates the utility 
of making requests wait. We hope that this and related ideas are helpful in understanding other performance 
measures in the broadcast setting. Particularly, can 'waiting' combined with some known algorithm, like 
most requests first, be used to improve the current best known online algorithm for minimizing the average 
response time? Another interesting problem is whether there is a (1 + e)-speed 0(l)-competitive algorithm 
for delay factor. Our algorithm has a parameter that controls the waiting time. Is there an algorithm that 
avoids taking an explicit parameter and "learns" it along the way? 

Acknowledgments: We thank Samir Khuller for clarifications on previous work and for his encouragement. 
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A Omitted Proofs 



A. 1 Proof of Theorem | 

Proof. For sake of contradiction, assume that some algorithm that achieves a competitive ratio better than 

pA 
2 



P 4 

-^ exists. Now consider the following example. 



Type 1: At time let the cUent request a page with processing time and deadline P. This 
request has slack P. 

Type 2: At times P - P-^, P, P + P-^, ... , P^-^^ - P-^ let the chent request a page with 
processing time P-^ and a deadline P-^ time units after its arrival time. These requests have 
slack P-6. 

Consider time P^'^^. Assume that this is all of the requests which the client makes. The optimal solution 
schedules these requests in a first in first out fashion. The optimal schedule finishes request type 1 by its 
deadhne. The requests of type 2 then finish at P'^ time units after their deadline. Thus, the delay factor for 
the optimal schedule is 2P-^/P-^ = 2. 

The maximum ratio of maximum to minimum slack values seen so far is -^ = P"*. Thus, the maximum 
delay factor our algorithm can have is (P'^)-^/2 = P-^^/2. Consider having the request of type 1 still in 
the deterministic algorithms queue. At time P^^^, the algorithm has achieved a delay factor of at least 



13 



'.±0 T'Viiic fhj^ ol rT/~\fil-MT-vi hoc o r>r\t-Y^r\i^titi\Ti^ fof-i/~\ r\T ot looct ±1 



p = P . Thus, the algorithm has a competitive ratio of at least -^-g— , a contradiction. Therefore, at time 
pi. 16 j-j^g algorithm must have finished the request of type 1. Now, immediately after this time, requests of 
type 3 arrive. 

Type 3: Starting at time p^-^^ the client requests p^'^—p-^ unit processing time requests each 
with a deadline one time unit after their amval time. These requests amve one after another, 
each time unit. The slack of these requests is 1 . 

These are all of the requests which are sent. The optimal solution schedules the request of type 1 until 
time P-^, thus has P-^ processing time left to finish this request. Then the optimal solution schedules the 
type 2 and type 3 requests as they arrive, giving them a delay factor of 1. At time p^-^^ _|_ pi-2 _ p. 6 
the optimal solution schedules the request of type 1 to completion. Thus delay factor of this solution is 

^"y < 2P-2. 

Our algorithm must have scheduled the request of type 1 by time P^-^^ . Thus the last request it finishes 
is either of type 2 or type 3. If the request is of type 2 then this request must have waited for all requests 
of type 3 to finish along with its processing time, thus the delay factor is at least ^ pji^ > P'^- If the 
last request satisfied by the algorithm is of type 3, then this request must have waited for a request of type 2 
to finish, so the delay factor is at least P'^. In either case, the competitive ratio of the algorithm is at least 

D 



^77 = -^, a contradiction. 



A.2 Proof of Lemma |Ml 

Proof. Suppose the lemma is false. Then there is a first time to when P<;-(to) — ^<fc(^o) = 2^^'^ and small 
constant 5t > such that P<fc(to + St) - P^^{to + 6t) > 2'"+^. Let t' = to + 5t. For this to occur, x 
processes a request of class < k during the interval / = [ig , t'] while y processes a request of class > k. 
Since each machine uses SSF, it must be that y had no requests in classes < k during / which implies that 
Ul^{t') = P|fc(t')- Therefore, 

C/|,(t') = P'^,{t') < PUt') - 2^=+^ < uUt') - 2'+\ 
since P^j^{t') < U^f.{t'). However, this implies that 

Ul,{t')<UUt')-2'^+\ 
a contradiction to Observation [1] D 

A.3 Proof of Lemma |2J] 

Proof. Combining Observation [H Lemma |Z61 and the fact that R{t) = U{t) — P{t) by definition then, 

\Rlkit) - Rl,{t)\ < \UUt) - C/|,(t)| + |P|,(t) - P|,(t)| < 2^^\ 

D 
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A.4 Proof of Theorem O 

Proof. Let A be any online 1-speed algorithm. We consider the following adversary. At time 0, the ad- 
versary requests pages 1, . . . , ^, all which have a deadline of ^. Between time 1 and | the client requests 
whatever page the online algorithm A broadcasts immediately after that request is broadcast; this new re- 
quest also has a deadline of ^. It follows that at time t = § the online algorithm A has ^ requests for distinct 
pages in its queue. However, the adversary can finish all these requests by time ^. Then starting at time 
§ the adversary requests ^ new pages, say | + 1, . . . , n. These new pages are requested, one at each time 
step, in a cyclic fashion for -n? cycles. More formally, for i = 1, . . . , n/2, page ^ + i is requested at times 
j • (§) + i — 1 for J = 1, . . . , n. Each of these requests has a slack of one which means that their deadline 
is one unit after their arrival. The adversary can satisfy these requests with delay since it has no queue at 
any time; thus its maximum delay factor is 1. However, the online algorithm A has ^ requests in its queue 
at time ^; each of these has a slack of ^. We now argue that the delay factor of ^4 is 0(n). If the algorithm 
satisfies two slack 1 requests for the same page by a single transmission, then its delay factor is n/2; this 
follows since the requests for the same page are n/2 time units apart. Otherwise, the algorithm does not 
merge any requests for the same page and hence finishes the the last request by time n/2 + n? /2 + n/4. If 
the last request to be finished is a slack 1 request, then its delay factor is at least n/4 since the last slack 1 
requests is released at time n/2 + n^/2. If the last request to be finished is one of the requests with slack 
n/2, then its delay factor is at least n^/2/(n/2) = 0(n). D 
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